Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR

نویسندگان

  • Volker Leutnant
  • Reinhold Haeb-Umbach
چکیده

In this paper we consider the combination of hidden Markov models based on Gaussian mixture densities (GMM-HMM) and linear dynamic models (LDM) as the acoustic model for automatic speech recognition systems. In doing so, the individual strengths of both models, i.e. the modelling of long-term temporal dependencies by the GMM-HMM and the direct modelling of statistical dependencies between consecutive feature vectors by the LDM, are exploited. Phone classification experiments conducted on the TIMIT database indicate the prospective use of this approach in continuous speech recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

N-gram Based Modelling of Time and Feature Dependencies in HMM

Developing new acoustical models that overcome HMM based modelling restrictions is an active field of research in automatic speech recognition. In this paper two approaches are presented. The first one consists on exhaustively modelling the dependencies between the acoustic features used to parametrize the speech signal, improving the traditional use of dynamic features. N-gram based modelling ...

متن کامل

NATIONAL UNIVERSITY OF SINGAPORE School of Computing PH.D DEFENCE - PUBLIC SEMINAR

Automatic Speech Recognition (ASR) has been one of the most popular research areas in computer science. Many state-of-the-art ASR systems still use the Hidden Markov Model (HMM) for acoustic modelling due to its efficient training and decoding. HMM state output probability of an observation is assumed to be independent of the other states and the surrounding observations. Since temporal correla...

متن کامل

Modelling of Resonance Frequency of MEMS Corrugated Diaphragm for Capacitive Acoustic Sensors (TECHNICAL NOTE)

In this paper, a new model for resonance frequency of clamped circular corrugated diaphragm has been presented. First, an analytical analyzes has been carried out to derive mathematic expressions for mechanical sensitivity of diaphragm with residual stress. Next by using Rayleigh's method we present mathematical model to calculate the resonance frequency of corrugated diaphragm and investigate ...

متن کامل

On the Use of Augmented Hmm Models for Overcoming Time and Parameter Independence Assumptions in Asr

There is significant interest in developing new acoustic models for speech recognition that overcome traditional HMM restrictions. In this work, we propose to use Ngram based augmented HMMs. Two approaches are presented. The first one consists on overcoming the parameter independence assumption. This is achieved by modeling the dependence between the different acoustic parameters, using N-gram ...

متن کامل

Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles

One of the challenges in automatic speech recognition is how to handle pronunciation variation. The main causes for pronunciation variation are the speaker (voice characteristics, accent, non-nativeness etc.) and the speaking style (reading, spontaneous responses, conversation etc.). An ASR system has basically two options for modelling the variation on the word and sub-word level: lexical mode...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010